Skip to content

[ComfyUI]: ComfyUI integration#1113

Open
fhfuih wants to merge 31 commits intovllm-project:mainfrom
fhfuih:comfyui
Open

[ComfyUI]: ComfyUI integration#1113
fhfuih wants to merge 31 commits intovllm-project:mainfrom
fhfuih:comfyui

Conversation

@fhfuih
Copy link
Contributor

@fhfuih fhfuih commented Jan 30, 2026

Signed-off-by: Huang, Zeyu 11222265+fhfuih@users.noreply.github.com

Purpose

Design a one-in-all ComfyUI Integration for vLLM-Omni.

Close #900 (discussion about the UI design can go there)

Draft progress

Features I have experimented:

(This section is also added to plugin README)

The following features are tested:

  • Single-node workflows for
    • Multimodal Comprehension (e.g., Qwen Omni, BAGEL)
    • Text-to-Image Generation (e.g., Qwen-Image)
    • Image-to-Image Generation (e.g., Qwen-Image-Edit)
    • TTS (e.g., Qwen TTS, including VoiceDesign, VoiceClone, CustomVoice)

The following features are not currently tested. They will be tested in the future, and the READMEs will be updated accordingly

  • Multi-node workflow that connects multiple model services together.

Release Note

  • Officially support ComfyUI via a plugin at apps/ComfyUI-vLLM-Omni. Please check out the README in this folder for installation instructions.

Test Plan

No test for now. The test is difficult to add due to the following reasons:

  1. the source code imports ComfyUI internal files. This is possible in runtime because the source code itself will be placed in ComfyUI internal directory. However, the test files would need to mock them.
  2. Ideal test cases should run a real vllm service with mocked AsyncOmni in a subprocess. This is difficult to achieve, and may be introduced in another PR later.

For now, we rely on the existing entrypoint API tests to ensure that the API doesn't change.

The tests described above are WIP in my other branch https://github.com/fhfuih/vllm-omni/tree/comfyui-test. I will create another PR when it is ready.

Test Result

N/A


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. TODO: Will add later
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft.

Screenshots:

SAVE_20260206_114731
SAVE_20260206_114735
SAVE_20260206_114738
SAVE_20260206_114743

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

@fhfuih fhfuih changed the title [ComfyUI]: ComfyUI integration for image generation [ComfyUI]: ComfyUI integration Jan 30, 2026
@fhfuih fhfuih force-pushed the comfyui branch 5 times, most recently from 1890455 to 894dff2 Compare February 2, 2026 03:03
@david6666666 david6666666 self-requested a review February 4, 2026 02:34
@david6666666
Copy link
Collaborator

should we add follow features:

  • LoRA
  • Combinations of different outputs, such as image generation + image editing

etc...

fhfuih added 15 commits February 6, 2026 03:34
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Copilot AI review requested due to automatic review settings February 6, 2026 09:14
Copy link

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ff46ca8033

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
@david6666666
Copy link
Collaborator

@wtomin @SamitHuang @ZJY0516 ptal thx

@Gaohan123 Gaohan123 added the ready label to trigger buildkite CI label Feb 6, 2026
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR introduces a comprehensive ComfyUI integration for vLLM-Omni, enabling visual workflow-based inference for multimodal AI models through ComfyUI's node system. The integration provides nodes for image generation, multimodal comprehension, and text-to-speech tasks, supporting both single-stage and multi-stage model pipelines with configurable sampling parameters.

Changes:

  • Adds ComfyUI custom nodes for vLLM-Omni online serving API
  • Implements API client with support for image generation, editing, comprehension, and TTS
  • Provides sampling parameter nodes for autoregression and diffusion stages
  • Includes documentation, example workflows, and CI/CD workflows for publishing to ComfyUI registry

Reviewed changes

Copilot reviewed 30 out of 36 changed files in this pull request and generated 12 comments.

Show a summary per file
File Description
apps/ComfyUI-vLLM-Omni/__init__.py Plugin entry point defining node mappings and display names
apps/ComfyUI-vLLM-Omni/vllm_omni/nodes.py Core node implementations for generation and sampling parameters
apps/ComfyUI-vLLM-Omni/vllm_omni/utils/api_client.py Async HTTP client for vLLM-Omni API endpoints
apps/ComfyUI-vLLM-Omni/vllm_omni/utils/format.py Format conversion utilities for images, video, and audio
apps/ComfyUI-vLLM-Omni/vllm_omni/utils/validators.py Validation logic for model specs and sampling parameters
apps/ComfyUI-vLLM-Omni/vllm_omni/utils/models.py Model pipeline specifications and payload preprocessors
apps/ComfyUI-vLLM-Omni/vllm_omni/utils/logger.py Logging configuration with base64 redaction
apps/ComfyUI-vLLM-Omni/vllm_omni/utils/types.py Type definitions for audio formats and model specifications
apps/ComfyUI-vLLM-Omni/web/main.js Frontend extension (mostly commented out)
apps/ComfyUI-vLLM-Omni/web/utils.js Multiline text widget utilities
apps/ComfyUI-vLLM-Omni/pyproject.toml Package configuration and metadata
apps/ComfyUI-vLLM-Omni/README.md User-facing documentation and quickstart guide
apps/ComfyUI-vLLM-Omni/LICENSE Apache 2.0 license
tests/comfyui/test_example.py Basic smoke test for node instantiation
tests/comfyui/conftest.py Test configuration for path setup
.github/workflows/comfyui-validate.yml CI workflow for backward compatibility validation
.github/workflows/comfyui-publish.yml CI workflow for publishing to ComfyUI registry
.github/workflows/build_wheel.yml Updated to exclude apps directory from build triggers
docs/features/comfyui.md Feature documentation for the integration
docs/.nav.yml Added ComfyUI to documentation navigation
.gitignore Allows example workflow JSON files

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
@fhfuih
Copy link
Contributor Author

fhfuih commented Feb 6, 2026

Combinations of different outputs, such as image generation + image editing

@david6666666 What do you mean by "Combinations of different outputs"? In the current design, the "Generate Image" node can handle both image generation and image editing. Depending on whether there is an input image, it routes to the correct API endpoint with correct payload. Is this what you mean?

What I'm looking for is that the generated image can be connected to another API, v1/images/edit, for image editing, similar to a workflow.

Ah yes, we have had this discussion today, and now the readme and this PR have added a notice that connecting multiple model services are not tested. I can help test in the future and add relevant documentation and example workflows.

LoRA

And LoRA as well!

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
@david6666666
Copy link
Collaborator

LGTM, look forward to follow pr

@hsliuustc0106
Copy link
Collaborator

in the follow-up PR, please test Hunyuan Image 3.0 instruct model, we are going to use this model for demonstration and blogpost

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 22 out of 27 changed files in this pull request and generated 18 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready label to trigger buildkite CI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[RFC]: ComfyUI Integration Design

4 participants